Ontology Development for ETL Process Design

نویسنده

  • Mohd Syazwan Abdullah
چکیده

The Extract, Transform, Load (ETL) process design is difficult to perform because of the ambiguity of user requirements and the complexity of data integration and transformation. Current studies have explored the ontology-based approach to overcome these limitations by reconciling the semantics of user requirements within the ETL process design for easy generation of the ETL process specification. The ontology for ETL process activities has been developed by using the Requirement Analysis Method for ETL Processes (RAMEPs) that is gathered from the perspectives of organization, decision-maker, and developer. Therefore, the ontology is used to generate the ETL process specification for a student affairs’ Data Warehouse (DW) system. The correctness of the ontology model was validated by using an appropriate reasoner. Moreover, the process of ontology development for the case study is presented and shows how the ontology-based approach was successful in implementing the design and generating the ETL process specification.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Requirements Analysis Method For Extracting-Transformation-Loading (Etl) In Data Warehouse Systems

The data warehouse (DW) system design involves several tasks such as defining the DW schemas and the ETL processes specifications, and these have been extensively studied and practiced for many years. The problems in heterogeneous data integration are still far from being resolved due to the complexity of ETL processes and the fundamental problems of data conflicts in information sharing enviro...

متن کامل

Ontology-Driven Conceptual Design of ETL Processes Using Graph Transformations

One of the main tasks during the early steps of a data warehouse project is the identification of the appropriate transformations and the specification of inter-schema mappings from the source to the target data stores. This is a challenging task, requiring firstly the semantic and secondly the structural reconciliation of the information provided by the available sources. This task is a part o...

متن کامل

Flexible and Customizable NL Representation of Requirements for ETL processes

The design of an Extract – Transform – Load (ETL) workflow for the population of a Data Warehouse is a complex and challenging procedure. In previous work, we have presented an ontology-based approach to facilitate the conceptual design of an ETL scenario. In this paper, we elaborate on this work, by investigating the application of Natural Language (NL) techniques to the ETL environment and we...

متن کامل

Rameps: a Goal-ontology Approach to Analyse the Requirements for Data Warehouse Systems

The data warehouse (DW) systems design involves several tasks such as defining the DW schemas and the ETL processes specifications, and these have been extensively studied and practiced for many years. However, the problems in heterogeneous data integration are still far from being resolved due to the complexity of ETL processes and the fundamental problems of data conflicts in information shar...

متن کامل

A BPMN-Based Design and Maintenance Framework for ETL Processes

Business Intelligence (BI) applications require the design, implementation, and maintenance of processes that extract, transform, and load suitable data for analysis. The development of these processes (known as ETL) is an inherently complex problem that is typically costly and time consuming. In a previous work, we have proposed a vendor-independent language for reducing the design complexity ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015